From Scientific Workflow Patterns to 5-star Linked Open Data
نویسندگان
چکیده
Scientific Workflow management systems have been largely adopted by data-intensive science communities. Many efforts have been dedicated to the representation and exploitation of provenance to improve reproducibility in data-intensive sciences. However, few works address the mining of provenance graphs to annotate the produced data with domain-specific context for better interpretation and sharing of results. In this paper, we propose PoeM, a lightweight framework for mining provenance in scientific workflows. PoeM allows to produce linked in silico experiment reports based on workflow runs. PoeM leverages semantic web technologies and reference vocabularies (PROV-O, P-Plan) to generate provenance mining rules and finally assemble linked scientific experiment reports (Micropublications, Experimental Factor Ontology). Preliminary experiments demonstrate that PoeM enables the querying and sharing of Galaxy-processed genomic data as 5-star linked datasets.
منابع مشابه
Towards Open Publication of Reusable Scientific Workflows: Abstractions, Standards and Linked Data
In recent years, a variety of systems have been developed that export the workflows executed to analyze data and make them part of published articles. We argue that the workflows that are published with current approaches are dependent on the specific codes used for execution, the specific workflow system used, and the specific workflow catalogs where they are published. In this paper, we descr...
متن کاملGenerating Conference Linked Open Data in One Click
In this paper we describe cLODg2 (conference Linked Open Data generator version 2), a tool to collect, refine and produce Linked Data about scientific conferences with their associated publications, participants and events. Conference metadata collected from different unstructured and semi-structured resources must be expressed with appropriate vocabularies to be exposed as Linked Data. cLODg2 ...
متن کاملSara Migliorini 1 , †
Introduction Scientific Workflow Management Systems (WfMSs) are software systems developed for automating scientific experiments that need to deal with huge amounts of data. The main goal of these systems is to facilitate the reuse and integration of domain specific functions and tools through a graphical environment. Scientific WfMSs can be used to automate repetitive error-prone activities, s...
متن کاملOpen Source Workflow: A Viable Direction for BPM?
With the growing interest in open source software in general and business process management and workflow systems in particular, it is worthwhile investigating the state of open source workflow management. The plethora of these offerings (recent surveys such as [4, 6], each contain more than 30 such systems) triggers the following two obvious questions: (1) how do these systems compare to each ...
متن کاملA Visual Exploration Workflow as Enabler for the Exploitation of Linked Open Data
Semantically annotating and interlinking Open Data results in Linked Open Data which concisely and unambiguously describes a knowledge domain. However, the uptake of the Linked Data depends on its usefulness to non-Semantic Web experts. Failing to support data consumers to understand the added-value of Linked Data and possible exploitation opportunities could inhibit its diffusion. In this pape...
متن کامل